BIOTECHN' irY 
SYSTEM^ 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: CR.^ 7 

Source: t)/^ 
Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 




FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21help@usDto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(a>,uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check N sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the TISPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



4 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENCLISII "ALPHA" HEADERS. WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nuclcics 

Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



3 J Misali, 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



10 



11 



gncd Amino 
Numbering 

Non-ASCII 



5 Variable Length 



_Patcntln 2.0 
"bug" 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCIl(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII tcxL 

Scqucncc(s) contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

. A "bug" in Patcntln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

scqucnccs(s) . Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Scqucncc(s) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ rD NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ CD NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequencers) 

(NEW RULES) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

9 Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and wluch residue n or Xaa represents. 



Jnvalid <213> 
Response 

Use of<220> 



Per 1.823 of Sequence Rules, the only valid <213> responses arc: Unknown, Artificial Sequence, or 
scientific name (Gcnus/spccics). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 



Sequencers) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



12 Pataitln 2.0 

"bug" 



Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register " 06/01 /! 998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06/04/2001 
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Page 1 of 18 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/881,752 



DATE: 07/05/2001 
TIME: 15:54:39 



Input Set : A:\06132.041001 sequence listirig.txt 
Output Set: N:\CRF3\07032001\I881752.raw 



(ii) 



C— > 3 (1) 

5 

C--> 7 

8 

Helicobacter 
9 

11 (iii) 

13 (iv) 

14 

15 

16 

17 

18 

19 

21 (v) 

22 

23 

24 

25 

27 (vi) 
C--> 28 
C--> 29 

30 

32 (viii) 

33 

34 

35 

37 (ix) 

38 

39 

40 



SEQUENCE LISTING 
GENERAL INFORMATION: 

(i) APPLICANT: Harold Kleanthous et al . 

TITLE OF INVENTION: Identification of Polynucleotides 

Encoding Novel Helicobacter Polypeptides in the 



Does Not Comply 
Corrected Diskette Needec 



Genome 
NUMBER OF SEQUENCES: 370 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 
COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/881,752 

(B) FILING DATE: 18-Jun-2001 

(C) CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

(A) NAME: Clark, Paul T . 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE/DOCKET NUMBER: 06132/041001 
TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

(C) TELEX: 



2.0 



ERRORED SEQUENCES 



1361 (2) 

1363 

1364 

1365 

1366 

1367 

1369 

1370 

1372 

1373 

1374 

1376 



INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 




(Xi) 



Coding Sequence 
60. . .2039 
(D) OTHER INFORMATION: 
SEQUENCE DESCRIPTION: SEQ ID NO: 



21: 



file://C:\CRF3\Outhold\VsrI881752.htm 



7/5/01 
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Page 2 of 18 



RAW SEQUENCE LISTING DATE: 07/05/2001 

PATENT APPLICATION: US/09/881,752 TIME: 15:54:39 

Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 

1378 CCCTATCATA GGGCGTGGCA TGAAGAAAAA AGCAAAAGTC TTTTGGTATT GTTTTAATC 59 

1381 ATG ATT TAT TGG TTG TAT TTG GCG GTC TTT TTT TTG TTG AGC GCA TTA 107 

1382 Met He Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu 

1383 1 5 10 15 

1385 GAC GCT AAA GAA ATC GCT ATG CAA CGA TTT GAC AAA CAA AAC CAT AAG 155 

1386 Asp Ala Lys Glu He Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys 

1387 20 25 30 

13 89 ATT TTT GAA ATC CTT GCG GAT AAA GTG AGC GCT AAA GAC AAT GTG ATA 203 

1390 He Phe Glu He Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val He 

1391 35 40 45 

1393 ACC GCA TCA GGG AAT GCG ATC TTA TTG AAT TAT GAT GTG TAT ATT CTA 251 

1394 Thr Ala Ser Gly Asn Ala He Leu Leu Asn Tyr Asp Val Tyr He Leu 

1395 50 55 60 

1397 GCG GAC AAG GTG CGT TAT GAC ACT AAA ACC AAA GAA GCG TTA TTA GAG 299 

13 98 Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 
1399 65 70 75 80 

1401 GGG AAT ATC AAG GTT TAT AGG GGC GAG GGT TTG CTC GTT AAA ACC GAT 347 

1402 Gly Asn He Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys Thr Asp 

1403 85 90 95 

1405 TAC GTG AAA TTG AGT TTG AAT GAA AAA TAT GAA ATC ATT TTC CCC TTT 395 

14 06 Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu He He Phe Pro Phe 
1407 100 105 110 

14 09 TAT GTC CAA GAC AGC GTG AGC GGG ATT TGG GTG AGC GCG GAT ATT GCC 443 

1410 Tyr Val Gin Asp Ser Val Ser Gly He Trp Val Ser Ala Asp He Ala 

14H 115 120 125 

1413 AGC GGA AAG GAT CAA AAA TAT AAG GTT AAA AAC ATG AGC ACT TCA GGG 4 91 

1414 Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr Ser Gly 

1415 130 135 140 

1417 TGC AGC ATT GAT AAC CCC ATT TGG CAT GTC AAT GCG ACT TCA GGC TCA 539 

1418 Cys Ser He Asp Asn Pro He Trp His Val Asn Ala Thr Ser Gly Ser 

1419 145 150 155 160 

14 21 TTC AAC ATG CAA AAA TCG CAT TTG TCT ATG TGG AAT CCT AAG ATC TAT 587 

14 22 Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys' He Tyr 

1423 165 170 175 

1425 GTC GGT GAT ATT CCT GTA TTG TAT TTG CCC TAT ATT TTC ATG TCC ACG 635 

1426 Val Gly Asp He Pro Val Leu Tyr Leu Pro Tyr He Phe Met Ser Thr 

1427 180 185 190 

14 29 AGC AAT AAA AGA ACT ACT GGG TTT TTA TAC CCT GAG TTT GGC ACT TCC 683 

14 30 Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly Thr Ser 

!431 195 200 205 

1433 AAC TTA GAC GGC TTT ATT TAT TTG CAA CCC TTT TAT TTA GCC CCC AAA 731 

14 34 Asn Leu Asp Gly Phe He Tyr Leu Gin Pro Phe Tyr Leu Ala Pro Lys 

1435 210 215 220 

14 37 AAC TCA TGG GAT ATG ACC TTT ACC CCA CAA ATC CGC TAT AAA AGG GGT 779 

1438 Asn Ser Trp Asp Met Thr Phe Thr Pro Gin He Arg Tyr Lys Arg Gly 

1439 225 230 235 240 

1441 TTT GGC TTG AAT TTT GAA GCG CGC TAC ATT AAC TCT AAA AAC GAC AGG 827 

1442 Phe Gly Leu Asn Phe Glu Ala Arg Tyr He Asn Ser Lys Asn Asp Arg 

14 43 245 250 255 



file://C:\CRF3\Outhold\VsrI881752.htm 



7/5/01 



RAW SEQUENCE LISTING DATE: 07/05/2001 

PATENT APPLICATION: US/09/881, 752 TIME: 15:54:39 

Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 



1445 


TTT 


TTA 


TTC 


AAC 


GCG 


CGC 


TAT 


TTT 


AGG 


AAT 


TAC 


ACC 


CAA 


TAT 


GTC 


AAA 


875 


1446 


Phe 


Leu 


Phe 


Asn 


Ala 


Arg 


Tyr 


Phe 


Arg 


Asn 


Tyr 


Thr 


Gin 


Tyr 


Val 


Lys 




1447 








260 










265 










270 








1449 


CGC 


TAC 


GAT 


TTG 


AGG 


AAT 


CAA 


AAT 


ATC 


TAC 


GGG 


TTT 


GAA 


TTT 


TTA 


AGC 


923 


1450 


Arg 


Tyr 


Asp 


Leu 


Arg 


Asn 


Gin 


Asn 


He 


Tyr 


Gly 


Phe 


Glu 


Phe 


Leu 


Ser 




1451 






275 










280 










285 










1453 


TCT 


AGC 


AGG 


GAC 


ACT 


TTA 


CAA 


AAA 


TAC 


TTC 


CAC 


CTT 


AAG 


TCT 


AAT 


ATT 


971 


1454 


Ser 


Ser 


Arg 


Asp 


Thr 


Leu 


Gin 


Lys 


Tyr 


Phe 


His 


Leu 


Lys 


Ser 


Asn 


He 




1455 




290 










295 










300 












1457 


GAC 


AAC 


GGG 


CAT 


TAC 


ATT 


GAC 


TTT 


TTA 


TAC 


ATG 


AAC 


GAT 


TTG 


GAC 


TAT 


1019 


1458 


Asp 


Asn 


Gly 


His 


Tyr 


He 


Asp 


Phe 


Leu 


Tyr 


Met 


Asn 


Asp 


Leu 


Asp 


Tyr 




1459 


305 










310 










315 










320 




1461 


GTG 


CGT 


TTT 


GAA 


AAG 


GTT 


AAT 


AAG 


CGT 


ATC 


ACA 


GAC 


GCC 


ACG 


CAC 


ATG 


1067 


1462 


Val 


Arg 


Phe 


Glu 


Lys 


Val 


Asn 


Lys 


Arg 


He 


Thr 


Asp 


Ala 


Thr 


His 


Met 




1463 










325 










330 










335 






1465 


TCT 


AGG 


GCG 


AAT 


TAC 


TAT 


TTG 


CAA 


ACA 


GAA 


AAC 


AAT 


TAT 


TAC 


GGC 


TTG 


1115 


1466 


Ser 


Arg 


Ala 


Asn 


Tyr 


Tyr 


Leu 


Gin 


Thr 


Glu 


Asn 


Asn 


Tyr 


Tyr 


Gly 


Leu 




1467 








340 










345 










350 








1469 


AAT 


ATC 


AAG 


TAT 


TTT 


TTA 


AAC 


CTG 


AAT 


AAA 


ATC 


AAC 


AAT 


AAC 


CGC 


ACT 


1163 


1470 


Asn 


He 


Lys 


Tyr 


Phe 


Leu 


Asn 


Leu 


Asn 


Lys 


He 


Asn 


Asn 


Asn 


Arg 


Thr 




1471 






355 










360 










365 










1473 


TTC 


CAA 


TCT 


GTC 


CCT 


AAT 


TTG 


CAA 


TAC 


CAT 


AAA 


TAT 


TTA 


AAT 


TCT 


TTG 


1211 


1474 


Phe 


Gin 


Ser 


Val 


Pro 


Asn 


Leu 


Gin 


Tyr 


His 


Lys 


Tyr 


Leu 


Asn 


Ser 


Leu 




1475 




370 










375 










380 












1477 


TAT 


TTT 


AGA 


AAT 


TTG 


TTG 


TAT 


TCG 


GTG 


GAT 


TAT 


CAG 


TTT 


AGA 


AAC 


ACC 


1259 


1478 


Tyr 


Phe 


Arg 


Asn 


Leu 


Leu 


Tyr 


Ser 


Val 


Asp 


Tyr 


Gin 


Phe 


Arg 


Asn 


Thr 




1479 


385 










390 










395 










400 




1481 


GCA 


AGA 


GAG 


ATT 


GGT 


TAT 


GGC 


TAT 


GTG 


CAA 


AAC 


GCT 


TTG 


AAT 


GTG 


CCG 


1307 


1482 


Ala 


Arg 


Glu 


He 


Gly 


Tyr 


Gly 


Tyr 


Val 


Gin 


Asn 


Ala 


Leu 


Asn 


Val 


Pro 




1483 










405 










410 










415 






1485 


GTG 


GGC 


TTG 


CAA 


TTT 


TCT 


TTG 


TTT 


AAA 


AAG 


TAT 


TTG 


TCT 


TTA 


GGG 


CTT 


1355 


1486 


Val 


Gly 


Leu 


Gin 


Phe 


Ser 


Leu 


Phe 


Lys 


Lys 


Tyr 


Leu 


Ser 


Leu 


Gly Leu 




1487 








420 










425 










430 








1489 


TGG 


AAT 


GAT 


CTC 


CAA 


CTA 


TCT 


AAT 


GTG 


GCT 


TTA 


ATG 


CAA 


TCT 


AAA 


AAT 


1403 


1490 


Trp 


Asn 


Asp 


Leu 


Gin 


Leu 


Ser 


Asn 


Val 


Ala 


Leu 


Met 


Gin 


Ser 


Lys 


Asn 




1491 






435 










440 










445 










1493 


TCC 


TTC 


GTG 


CCT 


ACG 


ATC 


CCT 


AAT 


GAA 


TCA 


AGG 


GAA 


TTT 


GGG 


AAT 


TTT 


1451 


1494 


Ser 


Phe 


Val 


Pro 


Thr 


He 


Pro 


Asn 


Glu 


Ser 


Arg 


Glu 


Phe 


Gly 


Asn 


Phe 




1495 




450 










455 










460 












1497 


GTG 


TCT 


TCA 


AAT 


TTT 


TCC 


ATG 


TAT 


GTC 


AAT 


ACG 


GAT 


TTG 


GCT 


AGA 


GAA 


1499 


1498 


Val 


Ser 


Ser 


Asn 


Phe 


Ser 


Met 


Tyr 


Val 


Asn 


Thr 


Asp 


Leu 


Ala 


Arg 


Glu 




1499 


465 










470 










475 










480 




1501 


TAC 


AAC 


AAG 


CTT 


TTC 


CAC 


ACG 


ATC 


CAA 


CTA 


GAA 


GCG 


ATT 


TTC 


AAC 


ATC 


1547 


1502 


Tyr 


Asn 


Lys 


Leu 


Phe 


His 


Thr 


He 


Gin 


Leu 


Glu 


Ala 


He 


Phe 


Asn 


He 




1503 










485 










490 










495 






1505 


CCT 


TAT 


TAC 


ACC 


TTT 


AAA 


AAC 


GGC 


TTA 


TTT 


TCT 


CAA 


AAC 


ATG 


TAT 


GCT 


1595 


1506 


Pro 


Tyr 


Tyr 


Thr 


Phe 


Lys 


Asn 


Gly 


Leu 


Phe 


Ser 


Gin 


Asn 


Met 


Tyr 


Ala 




1507 








500 










505 










510 








1509 


TTA 


AGC 


GCG 


CAA 


GCC 


TTA 


AAC 


AGC 


TAC 


ACT 


TCG 


CCT 


TTA 


TTG 


AGA 


GAT 


1643 



file://C:\CRF3\Outhold\VsrI88 1 752.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/881,752 



DATE: 07/05/2001 
TIME: 15:54:39 



Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 

1510 Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu Arg Asp 

1511 515 520 525 

1513 TAT GAT TAT CAA GGG CGT TTG TAT GAC TCG GTG TGG AAT CCT AGC AGT 1691 

1514 Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro Ser Ser 

1515 530 535 540 

1517 ATT TTA CCT AGC AAT GCG AGC AAC AAG ACG GTG GAT TTA ACC CTA ACG 173 9 

1518 lie Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr Leu Thr 

1519 545 550 555 560 

1521 CAA TAC CTT TAT GGC TTA GGG GGG CAA GAG TTA TTG TAT TTT AAA ATA ,1787 

1522 — . 



31N TYR LeU TYR GlY LeU GlY GlY GlN GlU LeU LeU TYR PHe LYS 
T524 



He 





TOG GAA CTC ATC AAT CTp GAC GAT AAA GTT TCG CCC TTT,AGA ATG CCA 
Ser Gin Leu He Asn Leu Asp Asp Lys Val Ser Pro Phe Arg Met Pro 

580 585 . 590 

CTA GAG AGC AAG ATC GGG TTT TCG CCC TTA ACG GGA TTG AAC ATC TTT 
Leu Glu Ser Lys He Gly Phe Ser Pro Leu Thr Gly Leu Asn He Phe 

595 600 605 

GGG AAT GTC TTT TAT TCG TTT TAT CAA AAC CGC TTA GAA GAA ATC TCT 
Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu He Ser 

610 615 620 

GTG AAC GCC AAT TAC CAA CGC AAG TTT TTA AGC TTT AAC CTC TCT TAT 
Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu Ser Tyr 
625 630 635 640 

TTT TTA AAA AAC AAT TTT AGC AGT GGG ATT AAT AGC ATT GTA GAA AAT 
Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val Glu Asn 

645 650 655 

CTG CGG ATT ATT TAAAGGCGGG TTTTAGCAAC GACTTTGGCT ATTTTTCCAT GAGCGC 
Leu Arg He He * - 

660 

GGATGTGGGT TATGATATTA GAAA£AATGT GGTTTTAAAT TGGAATGTGG GGA0JTTATAA 
AAAAATCCGT TGTTTTGGGA TTGG 

(2) INFORMATION FOR SEQ ID NO: 111: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
Genomic DNA 



AAA ATA , 1787 . f> , c 



1835 



1883 



1931 



1979 



2027 



2085 



2145 
2169 



(ii) 
(ix) 



(xi) 



Coding Sequence 
53. . .352 



E--> 



MOLECULE TYPE: 
FEATURE : 

(A) NAME/KEY: 

(B) LOCATION: 
(D) OTHER INFORMATION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 111 
CTAATTCTGT CTATTACACC AAC^AJTCAAT CTCAAAACAA AGGACATJGAAAG ATG AAA 
* ' ' * * 4 Met Lys 

1 

ACA AAA CAT AAA GGA ATA AGA ATG TTT AAG CAA ATT CGT AGA ATG ATG 
Thr Lys His Lys Gly He Arg Met Phe Lys Gin He Arg Arg Met Met/ 

5 10 15 

AGT TTG GCA ATA TTA ATG CCT AGT TTT TTA TTG GCG GCA CCA GAT TAC 



58 



154 JfteJ*. JL^tf 




file://C:\CRF3\Outhold\VsrI881752.htm 
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RAW SEQUENCE LISTING DATE: 07/05/2001 

PATENT APPLICATION: US/09/881,752 TIME: 15:54:40 



Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 



8339 


Ser 


Leu 


Ala 


He 


Leu 


Met 


Pro 


Ser 


Phe 


Leu 


Leu 


Ala 


Ala 


Pro 


Asp 


Tyr 




8340 




20 










25 










30 












8342 


AAA 


CAA 


AAA 


TTC 


ACT 


CAA 


ATA 


TTG 


GAT 


TTC 


ATA 


AGC 


AAT 


GAC 


TTT 


ATC 


202 


8343 


Lys 


Gin 


Lys 


Phe 


Thr 


Gin 


He 


Leu 


Asp 


Phe 


He 


Ser 


Asn 


Asp 


Phe 


He 




8344 


35 










40 










45 










50 




8346 


AAG 


GCT 


ATT 


GGT 


GGT 


CTA 


ATC 


ATT 


GTT 


GGG 


ACT 


TGC 


ATT 


TAC 


GCC 


TAT 


250 


8347 


Lys 


Ala 


He 


Gly 


Gly 


Leu 


He 


He 


Val 


Gly 


Thr 


Cys 


He 


Tyr 


Ala 


Tyr 




8348 










55 










60 










65 






8350 


AAA 


AAT 


TGG 


GAC 


AGG 


CTT 


GGA 


GAA 


ATT 


GGT 


TGG 


AAA 


TGC 


GTT 


GGG 


ATT 


298 


8351 


Lys 


Asn 


Trp 


Asp 


Arg 


Leu 


Gly 


Glu 


He 


Gly 


Trp 


Lys 


Cys 


Val 


Gly 


He 




8352 








70 










75 










80 








8354 


ATC 


ATT 


ATA 


ACC 


GCT 


GCT 


ATT 


TCT 


AAT 


GCT 


AAA 


ACT 


TTA 


AGT 


CAA 


TGG 


346 


8355 


He 


He 


He 


Thr 


Ala 


Ala 


He 


Ser 


Asn 


Ala 


Lys 


Thr 


Leu 


Ser 


Gin 


Trp 




8356 






85 










90 










95 











8358 TTA TTT TAGATGGCAT TGCATATTGT TTGTGTTGAA AGTATCAACA TTAGA 397 

8359 Leu Phe 

8360 100 

10141 (2) INFORMATION FOR SEQ ID NO: 138: 
1014 3 (i) SEQUENCE CHARACTERISTICS: 

10144 (A) LENGTH: 1290 amino acids 

10145 (B) TYPE: amino acid 

10146 (C) STRANDEDNESS : single 

10147 (D) TOPOLOGY: linear 

10149 (ii) MOLECULE TYPE: protein 

10150 (v) FRAGMENT TYPE: internal 
10152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



10154 


Met 


Glu 


He 


Gin 


Gin 


Thr 


His 


Arg 


Lys 


He Asn 


Arg 


Pro 


Leu 


Val 


Ser 


10155 


1 








5 










10 








15 




10156 


Leu 


Ala 


Leu 


Val 


Gly 


Ala 


Leu 


Val 


Ser 


He Thr 


Pro 


Gin 


Gin 


Ser 


His 


10157 








20 










25 








30 






10158 


Ala 


Ala 


Phe 


Phe 


Thr 


Thr 


Val 


He 


He 


Pro Ala 


He 


Val 


Gly 


Gly 


He 


10159 






35 










40 








45 








10160 


Ala 


Thr 


Gly 


Ala 


Ala 


Val 


Gly 


Thr 


Val 


Ser Gly 


Leu 


Leu 


Gly 


Trp 


Gly 


10161 




50 










55 








60 










10162 


Leu 


Lys 


Gin 


Ala 


Glu 


Glu 


Ala 


Asn 


Lys 


Thr Pro 


Asp 


Lys 


Pro 


Asp 


Lys 


10163 


65 










70 








75 










80 


10164 


Val 


Trp 


Arg 


He 


Gin 


Ala 


Gly 


Lys 


Gly 


Phe Asn 


Glu 


Phe 


Pro 


Asn 


Lys 


10165 










85 










90 








95 




10166 


Glu 


Tyr 


Asp 


Leu 


Tyr 


Arg 


Ser 


Leu 


Leu 


Ser Ser 


Lys 


He 


Asp 


Gly Gly 


10167 








100 










105 








110 






10168 


Trp 


Asp 


Trp 


Gly 


Asn 


Ala 


Ala 


Thr 


His 


Tyr Trp 


Val 


Lys 


Gly 


Gly Gin 


10169 






115 










120 








125 








10170 


Trp Asn 


Lys 


Leu 


Glu 


Val 


Asp 


Met 


Lys 


Asp Ala 


Val 


Gly 


Thr 


Tyr 


Asn 


10171 




130 










135 








140 










10172 


Leu 


Ser 


Gly 


Leu 


Arg 


Asn 


Phe 


Thr 


Gly 


Gly Asp 


Leu 


Asp 


Val 


Asn 


Met 


10173 


145 










150 








155 










160 


10174 


Gin 


Lys 


Ala 


Thr 


Leu 


Arg 


Leu 


Gly 


Gin 


Phe Asn 


Gly 


Asn 


Ser 


Phe 


Thr 


10175 










165 










170 








175 




10176 


Ser 


Tyr 


Lys 


Asp 


Ser 


Ala 


Asp 


Arg 


Thr 


Thr Arg 


Val 


Asp 


Phe 


Asn 


Ala 
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10177 








180 










185 










190 






10178 


Lys 


Asn 


He 


Leu 


He 


Asp 


Asn 


Phe 


Leu 


Glu 


He 


Asn 


Asn 


Arg 


Val 


Gly 


10179 






195 










200 










205 








10180 


Ser Gly Ala 


Gly 


Arg 


Lys 


Ala 


Ser 


Ser 


Thr 


Val 


Leu 


Thr 


Leu 


Gin 


Ala 


10181 




210 










215 










220 










10182 


Ser 


Glu 


Gly 


He 


Thr 


Ser 


Ser 


Lys 


Asn 


Ala 


Glu 


He 


Ser 


Leu 


Tyr 


Asp 


10183 


225 










230 










235 










240 


10184 


Gly Ala 


Thr 


Leu 


Asn 


Leu 


Ala 


Ser 


Asn 


Ser 


Val 


Lys 


Leu 


Met 


Gly 


Asn 


10185 










245 










250 










nee 

255 




10186 


Val 


Trp 


Met 


Gly 


Arg 


Leu 


Gin 


Tyr Val 


Gly 


Ala 


Tyr 


Leu 


Ala 


Pro 


Ser 


10187 








260 










265 










270 






10188 


Tyr 


Ser 


Thr 


He 


"Asn 


Thr 


Ser 


Lys 


Val 


Thr 


Gly 


Glu 


Val 


Asn 


Phe 


Asn 


10189 






275 










280 










285 








10190 


His 


Leu 


Thr 


Val 


Gly Asp 


His 


Asn 


Ala 


Ala 


Gin 


Ala 


Gly 


He 


lie 


Ala 


10191 




290 










295 










300 










10192 


Ser 


Asn 


Lys 


Thr 


His 


He 


Gly 


Thr 


Leu 


Asp 


Leu 


Trp 


Gin 


Ser 


Ala 


Gly 


10193 


305 










310 










315 










320 


10194 


Leu 


Asn 


He 


He 


Ala 


Pro 


Pro 


Glu Gly Gly 


Tyr 


Lys 


Asp 


Lys 


Pro 


Lys 


10195 










325 










330 










335 




10196 


Asp 


Lys 


Pro 


Ser 


Asn 


Thr 


Thr 


Gin 


Asn 


Asn 


Ala 


Asn 


Asn 


Asn 


Gin 


Gin 


10197 








340 










345 










350 






10198 


Asn 


Ser 


Ala 


Gin 


Asn 


Asn 


Ser 


Asn 


Thr 


Gin 


Val 


He 


Asn 


Pro 


Pro 


Asn 


10199 






355 










360 










365 








10200 


Ser 


Ala 


Gin 


Lys 


Thr 


Glu 


He 


Gin 


Pro 


Thr 


Gin 


Val 


lie 


Asp 


Gly 


Pro 


10201 




370 










375 










380 










10202 


Phe Ala Gly 


Gly 


Lys 


Asp 


Thr 


Val 


Val 


Asn 


lie 


Asp 


Arg 


He 


Asn 


Thr 


10203 


385 










390 










395 










400 


10204 


Asn 


Ala 


Asp 


Gly 


Thr 


He 


Lys 


Val Gly Gly 


Tyr 


Lys 


Ala 


Ser 


Leu 


Thr 


10205 










405 










410 










415 




10206 


Thr 


Asn 


Ala 


Ala 


His 


Leu 


His 


He Gly 


Lys 


Gly 


Gly 


He 


Asn 


Leu 


Ser 


10207 








420 










425 










430 






10208 


Asn 


Gin 


Ala 


Ser 


Gly Arg 


Thr 


Leu 


Leu 


Val 


Glu 


Asn 


Leu 


Thr 


Gly 


Asn 


10209 






435 










440 










445 








10210 


He 


Thr 


Val 


Asp 


Gly 


Pro 


Leu 


Arg 


Val 


Asn 


Asn 


Gin 


Val 


Gly 


Gly 


Tyr 


10211 




450 










455 










460 










10212 


Ala 


Leu 


Ala 


Gly 


Ser 


Ser 


Ala 


Asn 


Phe 


Glu 


Phe 


Lys 


Ala 


Gly 


Thr 


Asp 


10213 


465 










470 










475 










480 


10214 


Thr 


Lys 


Asn 


Gly 


Thr 


Ala 


Thi* 


Phe 


Asn 


Asn 


Asp 


lie 


Ser 


Leu 


Gly 


Arg 


10215 










485 










490 










495 




10216 


Phe 


Val 


Asn 


Leu 


Lys 


Val 


Asp 


Ala 


His 


Thr 


Ala 


Asn 


Phe 


Lys 


Gly 


He 


10217 








500 










505 










510 






10218 


Asp Thr Gly 


Asn 


Gly Gly Phe 


Asn 


Thr 


Leu 


Asp 


Phe 


Ser 


Gly 


Val 


Thr 


10219 






515 










520 










525 








10220 


Gly Lys Val 


Asn 


He 


Asn 


Lys 


Leu 


lie 


Thr 


Ala 


Ser 


Thr 


Asn 


Val 


Ala 


10221 




530 










535 










540 










10222 


Val 


Lys 


Asn 


Phe 


Asn 


He 


Asn 


Glu 


Leu 


Val 


Val 


Lys 


Thr 


Asn 


Gly 


Val 


10223 


545 










550 










555 










560 


10224 


Ser 


Val 


Gly 


Glu 


Tyr 


Thr 


His 


Phe 


Ser 


Glu 


Asp 


He 


Gly 


Ser 


Gin 


Ser 


10225 










565 










570 










575 
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10226 


Arg 


He 


Asn 


Thr 


Val 


Arg 


Leu 


Glu 


Thr 


Gly 


Thr 


Arg 


Ser 


He 


Phe 


Ser 


10227 








580 










585 










590 






10228 


Gly 


Gly Val 


Lys 


Phe 


Lys 


Ser 


Gly 


Glu 


Lys 


Leu 


Val 


He 


Asp 


Glu 


Phe 


10229 






595 










600 










605 








10230 


Tyr 


Tyr 


Ser 


Pro 


Trp 


Asn 


Tyr 


Phe 


Asp 


Ala 


Arg 


Asn 


He 


Lys 


Asn 


Val 


10231 




610 










615 










620 










10232 


Glu 


He 


Thr 


Arg 


Lys 


Phe 


Ala 


Ser 


Ser 


Thr 


Pro 


Glu 


Asn 


Pro 


Trp 


Gly 


10233 


625 










630 










635 










640 


10234 


Thr 


Ser 


Lys 


Leu 


Met 


Phe 


Asn 


Asn 


Leu 


Thr 


Leu 


Gly 


Gin 


Asn 


Ala 


Val 


10235 










645 










650 










655 




10236 


Met 


Asp 


Tyr 


Ser 


Gin 


Phe 


Ser 


Asn 


Leu 


Thr 


He 


Gin 


Gly 


Asp 


Phe 


He 


10237 








660 










665 










670 






10238 


Asn 


Asn 


Gin 


Gly Thr 


He 


Asn 


Tyr 


Leu 


Val 


Arg 


Gly 


Gly 


Gin 


Val 


Ala 


10239 






675 










680 










685 








10240 


Thr 


Leu 


Asn 


Val 


Gly 


Asn 


Ala 


Ala 


Ala 


Met 


Phe 


Phe 


Ser 


Asn 


Asn 


Val 


10241 




690 










695 










700 










10242 


Asp 


Ser 


Ala 


Thr Gly 


Phe 


Tyr 


Gin 


Pro 


Leu 


Met 


Lys 


He 


Asn 


Ser 


Ala 


10243 


705 










710 










715 










720 


10244 


Gin 


Asp 


Leu 


He 


Lys 


Asn 


Lys 


Glu 


His 


Val 


Leu 


Leu 


Lys 


Ala 


Lys 


He 


10245 










725 










730 










735 




10246 


He 


Gly 


Tyr 


Gly Asn 


Val 


Ser 


Leu 


Gly 


Thr 


Asn 


Ser 


He 


Ser 


Asn 


Val 


10247 








740 










745 










750 






10248 


Asn 


Leu 


He 


Glu 


Gin 


Phe 


Lys 


Glu 


Arg 


Leu 


Ala 


Leu 


Tyr 


Asn 


Asn 


Asn 


10249 






755 










760 










765 








10250 


Asn 


Arg 


Met 


Asp 


He 


Cys 


Val 


Val 


Arg 


Asn 


Thr 


Asp 


Asp 


He 


Lys 


Ala 


10251 




770 










775 










780 










10252 


Cys 


Gly 


Thr 


Ala 


He 


Gly 


Asn 


Gin 


Ser 


Met 


Val 


Asn 


Asn 


Pro 


Asp 


Asn 


10253 


785 










790 










795 










800 


10254 


Tyr 


Lys 


Tyr 


Leu 


He 


Gly 


Lys 


Ala 


Trp 


Lys 


Asn 


He 


Gly 


He 


Ser 


Lys 


10255 










805 










810 










815 




10256 


Thr 


Ala 


Asn 


Gly Ser 


Lys 


He 


Ser 


Val 


Tyr 


Tyr 


Leu 


Gly 


Asn 


Ser 


Thr 


10257 








820 










825 










830 






10258 


Pro 


Thr 


Glu 


Lys Gly 


Gly 


Asn 


Thr 


Thr 


Asn 


Leu 


Pro 


Thr 


Asn 


Thr 


Thr 


10259 






835 










840 










845 








10260 


Ser 


Asn 


Val 


Arg 


Ser 


Ala 


Asn 


Asn 


Ala 


Leu 


Ala 


Gin 


Asn 


Ala 


Pro 


Phe 


10261 




850 










855 










860 










10262 


Ala 


Gin 


Pro 


Ser 


Ala 


Thr 


Pro 


Asn 


Leu 


Val 


Ala 


He 


Asn 


Gin 


His 


Asp 


10263 


865 










870 










875 










880 


10264 


Phe 


Gly 


Thr 


He 


Glu 


Ser 


Val 


Phe 


Glu 


Leu 


Ala 


Asn 


Arg 


Ser 


Lys 


Asp 


10265 










885 










890 










895 




10266 


He 


Asp 


Thr 


Leu 


Tyr 


Ala 


Asn 


Ser 


Gly 


Ala 


Gin 


Gly 


Arg 


Asp 


Leu 


Leu 


10267 








900 










905 










910 






10268 


Gin 


Thr 


Leu 


Leu 


He 


Asp 


Ser 


His 


Asp 


Ala 


Gly 


Tyr 


Ala 


Arg 


Gin 


Met 


10269 






915 










920 










925 








10270 


He 


Asp 


Asn 


Thr 


Ser 


Thr 


Gly 


Glu 


He 


Thr 


Lys 


Gin 


Leu 


Asn 


Ala 


Ala 


10271 




930 










935 










940 










10272 


Thr 


Thr 


Thr 


Leu 


Asn 


Asn 


He 


Ala 


Ser 


Leu 


Glu 


His 


Lys 


Thr 


Ser 


Ser 


10273 


945 










950 










955 










960 


10274 


Leu 


Gin 


Thr 


Leu 


Ser 


Leu 


Ser 


Asn 


Ala 


Met 


He 


Leu 


Asn 


Ser 


Arg 


Leu 
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10275 
10276 
10277 
10278 
10279 
10280 
10281 
10282 

E--> 10283 
10284 
10285 
10286 
10287 
10288 
10289 
10290 
10291 
10292 

E--> 10293 
10294 
10295 
10296 
10297 
10298 
10299 
10300 
10301 
10302 

E--> 10303 
10304 
10305 
10306 
10307 
10308 
10309 
10310 
10311 
10312 

E--> 10313 
10314 
10315 
11673 
11675 
11676 
11677 
11678 
11679 
11681 
11682 



4 
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965 970 975 

Val Asn Leu Ser Arg Arg His Thr Asn Asn lie Asp Ser Phe Ala Gin 

980 - 985 990 

Arg Leu Gin Ala Leu Lys Asp Gin Lys Phe Ala Ser Leu Glu Ser Ala 

.995 1000 1005 

Ala Glu Val Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn 

1010 1015 1020 

Val Trp Ala Asn Ala lie Gly Gly Thr Ser Leu Asn Asn Gly Gly Asn 
<^Q25)lDZS 1030 1035 1040 

Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala Tyr Leu Asn Gly 

1045 1050 1055 

Glu Val Glu Ala lie Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser 

1060 1065 1070 

Phe Asn Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Thr Asn 

1075 1080 1085 

Phe Gly Val Tyr Ser Arg lie Phe Ala Asn Gin His Glu Phe Asp Phe 

1090 1095 1100 

Gljjr^Ala Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys 
/TOW 1110 1115 1120 

^e*rAla Leu Leu Arg Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr 
1125 1130 1135 . 

Ser Ala Ala Thr Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg 

1140 1145 1150 

Asn Ala Leu Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu 

1155 1160 1165 

Gly Ser Thr Asn Phe Lys Ser Asn Ser Asn Gin Val Ala Leu Lys Asn 

1170 1175 1180 

J^LY^Ser Ser Ser Gin His Leu Phe Asn Ala Ser Ala Asn Val Glu Ala 
(^18^/ 1190 1195 1200 

Arg Tyr Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Met Asn Ala Gly Val 

1205 1210 1215 

Leu Gin Glu Phe Ala Asn Phe Gly Ser Ser Asn Ala Val Ser Leu Asn 

1220 1225 1230 

Thr Phe Lys Val Asn Ala Ala His Asn Pro Leu Ser Thr His Ala Arg 

1235 1240 1245 

Val Met Met Gly Gly Glu Leu Lys Leu Ala Lys Glu Val Phe Leu Asn 
1250 1255 1260 

>*£reu\Gly Phe Val Tyr Leu His Asn Leu lie Ser Asn lie Gly His Phe 
\2§y 1270 1275 1280 

Ala Ser Asn Leu Gly Met Arg Tyr Ser Phe 
1285 1290 
(2) INFORMATION FOR SEQ ID NO: 150: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



(D) TOPOLOGY 
(ii) MOLECULE TYPE 
(V) FRAGMENT TYPE 



linear 

protein 

internal 



/ft 



Or] c, 

■ — " 
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11684 
11686 
11687 
11688 
11689 
11690 
11691 
11692 
11693 
11694 
11695 
11696 
11697 
11698 
11699 
11700 
11701 
11702 
11703 
11704 
11705 
11706 
11707 
11708 
11709 
11710 
11711 
11712 
11713 
11714 
11715 
11716 
11717 
11718 
11719 
11720 
11721 
11722 
11723 
11724 
11725 
11726 
11727 
11728 
11729 
11730 
11731 
11732 
11733 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
Met lie Lys Lys Ala Arg Lys Phe lie Pro Phe Phe Leu lie Gly Ser 

1_ 5 10 15 

Leu Leu Ala Glu Asp Asn Gly Trp Tyr Met Ser Val Gly Tyr Gin lie 

20 25 30 

Gly Gly Thr Gin Gin Phe lie Asn Asn Lys Gin Leu Leu Glu Asn Gin 

35 40 45 

Asn lie lie Asn Ser Val Thr Gin Ser Ala lie Asn lie Ala Gly Pro 

50 55 60 

Thr Thr Gly Leu lie Thr Leu Ser Ser Gin Thr Val lie Asp Ala Leu 
65 70 75 80 

Gly Tyr Gly Val Ser Asn Thr Val Gly Asn Gin Leu Glu Gly lie Ser 

85 90 95 

Asn lie Leu Asn Gin lie Gly Lys Arg Lys Asp Phe Tyr Ser Ser Arg 

100 105 110 

Gin lie Ser Ser lie Ser Gin Gin lie lie Gly Leu Lys Gly Ser Ser 

115 120 125 

Asp Pro Leu Lys Ala His Ser Ser Gin lie Thr Ala Lys Leu Leu Ser 

130 135 140 

Asn Thr Gin Ser Ala Phe Asp Gin Gly lie Ala Leu Ser Thr Asn lie 
145 150 155 160 

lie Ser Ser lie Asn Ser Leu Asn Pro Ser Asn Asn Thr Gin Glu Val 

165 170 175 

Lys Lys Gin Leu Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin 

180 185 190 

Gin lie Glu His Ser lie Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin 

195 200 205 

Ser Leu Leu Ser Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Ash Asn 

210 215 220 

Thr Ala Tyr Val Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val 
225 230 235 240 

Gly Val Phe Pro Thr Thr Thr Thr Thr His Val Val Leu Asn Pro Pro 

245 250 255 

Gly Gin Val Val Phe Tyr Pro Thr Asn Ser lie Leu Gly Ser Thr Ser 

260 265 270 

Ser Asn Ser Asn Asn Gin Gin Gin Tyr Asn Asn Thr Leu Leu Met Asn 

275 280 285 

Thr Leu Gin Gly Thr Leu Ser Ala Asn Thr Gin Asn Asn Pro Asn Gly 

290 295 300 

Cys Ala Asn Gin Val Gin Cys Leu Glu Gin Phe lie Gin Asn Leu Ala 
305 310 315 320 

Pro Leu Ala Ala Thr Pro Thr Ser Asn Asn Gin Ala Asn Gin Gin Val 

325 330 335 

Gin Ala lie Ala Gin Lys Leu Gin Ser Val Ala lie Asn Thr Leu Asp 

340 345 350 

Asn Asn Ala lie Asn Asn Thr Thr Tyr Asn Leu Asn Asn Leu His Asn 

355 360 365 

Ala Leu Asn Phe Gin Ala Tyr Glu Ser Thr lie Glu Gin Tyr Asn Asn 
370 375 380 
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11734 Ala Leu Lys Gin lie Ser Trp He Ser Phe Thr Glu Pro Lys Asn Leu 

11735 385 390 395 * 400 

11736 Leu Lys. Asn Thr Ser Asn Asn Tyr Gin He Gly Thr Val Thr Asn Ala 

11737 405 410 415 

11738 Gin Gly Gin Asn He Ser Ala Tyr Asp Cys Met Thr Ala Thr Gly Ser 

11739 420 425 430 

11740 Leu Ser Ser Asn Ala Ser Ser Gly He Ser Cys Ser Ala Thr Ser Ser 

11741 435 440 445 

11742 Thr Ser Ser Thr Asn Ser Phe Asp Asn Ser Leu Val Ala Thr Ser Lys 

11743 450 455 460 

11744 Val Gin Thr He Asn Gly Lys Glu Gin He Gly Val Asn Ser Phe Asn 

11745 465 470 475 480 

11746 Leu Val Ser Gin Val Trp Ser Val Tyr Asn Ser Leu Lys Thr Ser Glu 

11747 485 490 495 

11748 Glu Asn Leu Gin Lys Asn Ala Asn He Leu Cys Ala Asn Gly Thr Gin 

11749 500 505 510 

11750 Ser Gly Thr Ser Ser Cys Asn Ser Ser Ser Gly Gly Leu Ser He Ser 

11751 515 520 525 

11752 Gly Asn Ala Gin Leu Gin Asn He Leu Ser Pro Thr Ser Gly Thr Thr 

11753 530 535 540 

11754 Thr Asn Thr Gin Ala Lys Ser Asn Ala Pro Lys Leu Lys Ala Met Val 
H755 545 550 555 560 
11756 Val Val Asn Asn Glu Glu Glu Ala Lys Thr Ala Asn Leu Ala Gin Ser 
H757 565 570 575 

11758 Ser Gly Thr Thr Thr Gin Ser Pro Asn Ser Thr Val Met Gly Ala Leu 

11759 580 585 590 

11760 Asn Thr Val Leu Gin Asn Val Ser Asn Phe Gin Gin Ser He Gin Asn 
H761 595 600 605 

11762 Ala Phe Gin Asn Gin Glu Ser Asn He Gin Ala Trp Ala Asn Ala He 

11763 610 615 620 

11764 Tyr Asn Thr Asn Gly Ser Gin Ser Gin Glu Met Thr Pro Asn Asn Asn 

11765 625 630 635 640 

11766 Gin Asp Leu Arg He Gin Leu Arg Ala Asn Phe Tyr Gin Leu lie Asn 
H767 645 650 655 . 

11768 Thr He Asn Gin Gin Val Pro Thr Asp Met Asn Ala Leu He Asn Gin 

11769 660 665 670 

11770 Ser Gin Gin Thr Gin Gin Thr Ser Gly Ser Ala Ser Asn Asn Asn Ala 
H771 675 680 685 

11772 Cys Ala Ser Gly Met Ser Gly Ser Asn Gly Asn Trp Cys Tyr Gin Gin 

11773 690 695 700 

11774 Trp Ser Asp Ser Lys Ala Tyr Tyr Ser Gly Leu Gin Ser Ala Leu Gly 
H775 705 710 715 720 
11776 Tyr Gin Thr Gin Ala Thr Thr Gin Ser Gly Ser Asn Gly Gly Asn Ser 
H777 725 730 735 
11778 He Thr Tyr Asn Val Gin Gin He Thr Leu Thr Ser Asn Gly Leu Leu 
H779 740 745 750 

11780 Asn Gin He He Thr Asn Leu Lys Ser Val Asn Gly Gly Asn Gly Ala 

H781 755 760 765 

11782 Ser Gly Thr Gly Ser Gly Asn Gly Thr Ser Gin He Asn Thr Ala Tyr 
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11783 
11784 
11785 
11786 
11787 
11788 
11789 
11790 
11791 
11792 
11793 
11794 
11795 
11796 
11797 
11798 
11799 
11800 
11801 
11802 
11803 
11804 
11805 
11806 
11807 
11808 
11809 
11810 
11811 
11812 
11813 
11814 
E--> 11815 
11816 
11817 
11818 
11819 
11820 
11821 
11822 
11823 
11824^ 
E--> 1182! 
11826' 
11827 
11828 
11829 
11830 
11831 



Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 

770 775 780 

Gin Met Leu Thr Asp Ala Ser Asp Gly Lys Leu Gly Thr Tyr Ser Ser 
785 790 795 800 

Ser Ser Gly Ser Asn Asn Gly Tyr Thr Pro Cys Asn Ser Thr Asn Gly 

805 810 815 

Ser Asn Lys Thr Ser Gly Asn Asn Cys Tyr Glu Pro Asn Lys Gin Gin 

820 825 830 

Asn Ala Thr Thr Ala Thr Ala Thr Thr Asp Ser Asn Leu Gin Lys Val 

835 840 845 

Tyr Asn Asp Ala Gin Lys He Ala Asn He He Ala Ser Ser Gly Asn 

850 855 860 

Asn Lys Gly Val Glu Asn Gly Leu Lys Gin Phe Phe Glu Ala Leu Lys 
865 870 875 880 

Asn Asn Ser Ser Ser Leu Ser Asn Leu Cys Gly Asn Gly Ser Ser Gly 

885 890 895 

Ser Ser Gly Thr Thr Cys Ser Gly Trp Leu He Asn Leu Leu Gly Ala 

900 905 910 

He Pro Thr Asn Gly Val Ser Asp Thr Asn Asn Leu He Asn Leu Leu 

915 920 925 

Thr Glu Phe He Lys Thr Ala Gly Phe He Gin Asn Asn Asp Ser Ser 

930 935 940 

Val Ser Thr Ser Leu Thr Ser Ala Phe Gin Ala He Thr Ser Ala He 
945 950 . 955 960 

Ser Gin Gly Phe Gin Ala Leu Gin Asn Asp He Ser Pro Asn Ala He 

965 970 975 

Leu Thr Leu Leu Gin Glu He Thr Ser Asn Thr Thr Thr He Gin Ser 

980 985 990 

Phe Ser Gin Thr Leu Arg Gin Leu Leu Gly Asp Lys Thr Phe Phe Met 

995 1000 1005 

Ala Gin Gin Lys Leu lie Asp Ala Met He Asn Ala Arg Asn Gin Val 
1010 1015 1020 

pG ^A sn Ala Gln Asn Gln Ala Asn Asn T ^ r Ser Gln Pro Val Leu 

P25^y 1030 1035 1040 

^er Gln Tyr Ala Ala Ala Lys Ser Thr Gln His Gly Met Ser Asn Gly 

1045 1050 1055 

Leu Gly Val Gly Leu Gly Tyr Lys Tyr Phe Phe Gly Lys Ala Arg Lys 

1060 1065 1070 

Leu Gly Leu Arg His Tyr Phe Phe Phe Asp Tyr Gly Phe Ser Glu He 

1075 1080 1085 

Gly Leu Ala Asn Gln Ser Val Lys Ala Asn He Phe Ala Tyr Gly Val 
1090 1095 1100 

Thr Asp Phe Leu Trp Asn Leu Phe Arg Arg Thr Tyr Asn Thr Lys 
105/ 1110 1115 H20 

a Leu Asn Phe Gly Leu Phe Ala Gly Val Gln Leu Gly Gly Ala Thr 
1125 1130 1135 

Trp Leu Ser Ser Leu Arg Gln Gln He He Asp Asn Trp Gly Ser Ala 

1140 1145 1150 

Asn Asp He His Ser Thr Asn Phe Gln Val Ala Leu Asn Phe Gly Val 
1155 1160 1165 
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11832 
11833 
11834 
► 11835 
11836 
11837 
11838 
11839 
23321 
23323 
23324 
23325 
23326 
23327 
23329 
23330 
23332 
23334 
23335 
23336 
23337 
23338 
23339 
23340 
23341 
23342 
23343 
23344 
23345 
23346 
23347 
23348 
23349 
23350 
23351 
23352 
23353 
23354 
23355 
23356 
23357 
23358 
23359 
23360 
23361 
23362 
23363 
23364 
23365 



Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 

Arg Thr Asn Phe Ala Glu Phe Lys Arg Phe Ala Lys Lys Phe His Asn 

1170 1175 H80 

5lri\Gly Val He Ser Gin Lys Ser Val Glu Phe Gly He Lys Val Pro 
l&V H90 H95 120 o 

Hi He Asn Gin Ala Tyr Leu Asn Ser Ala Gly Ala Asp Val Ser Tyr 
1205 1210 1215 

Arg Arg Leu Tyr Thr Phe Tyr He Asn Tyr He Met Gly Phe 



1230 



1220 1225 
(2) INFORMATION FOR SEQ ID NO: 308: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1797 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
Met Ala Gly Arg His Gly Asn Lys Gly He Val Ser Asn He Val Pro 

1 5 io 15 

Val Ala Asp Met Pro Tyr Thr Ala Asp Gly Glu Pro Val Asp He Val 

20 25 30 

Leu Asn Pro Leu Gly Val Pro Ser Arg Met Asn He Gly Gin He Leu 

35 40 45 

Glu Met His Leu Gly Leu Val Gly Lys Glu Phe Gly Lys Gin -.-lie Ala 

50 55 60 

Arg Met Leu Glu Asp Lys Thr Lys Asp Phe Ala Lys Glu Leu Arg Ala 
65 70 75 80 

Lys Met Leu Glu Xaa Ala Asn Ala He Asn Glu Lys Asp Pro Leu Thr 

85 90 95 

He His Ala Leu Glu Asn Cys Ser Asp Glu Glu Leu Leu Glu Tyr Ala 

100 105 no 

Lys Asp Trp Ser Lys Gly Val Lys Met Ala He Pro Val Phe Glu Gly 

115 120 125 

He Ser Gin Glu Lys Phe Tyr Lys Leu Phe Glu Leu Ala Lys He Ala 

130 135 140 

Met Asp Gly Lys Met Asp Leu Tyr Asp Gly Arg Thr Gly Glu Lys Met 
i45 150 155 160 

Arg Glu Arg Val Asn Val Gly Tyr Met Tyr Met He Lys Leu His His 

165 170 175 

Leu Val Asp Glu Lys Val His Ala Arg Ser Thr Gly Pro Tyr Ser Leu 

180 185 190 

Val Thr His Gin Pro Val Gly Gly Lys Ala Leu Phe Gly Gly Gin Arg 

195 200 205 

Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala Ala His 

210 215 220 

Thr Leu Lys Glu Met Leu Thr He Lys Ser Asp Asp He Arg Gly Arg 
225 230 235 240 

Glu Asn Ala Tyr Arg Ala He Ala Lys Gly Glu Gin Val Gly Glu Ser 
245 250 255 
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23366 Glu He Pro Glu Thr Phe Tyr Val Leu Thr Lys Glu Leu Gin Ser Leu 

23367 260 265 270 

23368 Ala Leu Asp He Asn He Phe Gly Asp Asp Val Asp Glu Asp Gly Ala 

23369 275 280 285 

23370 Pro Lys Pro He Val He Lys Glu Asp Asp Arg Pro Lys Asp Phe Ser 

23371 290 295 300 

23372 Ser Phe Gin Leu Thr Leu Ala Ser Pro Glu Lys He His Ser Trp Ser 

23373 305 310 315 320 

23374 Tyr Gly Glu Val Lys Lys Pro Glu Thr He Asn Tyr Arg Thr Leu Lys 

23375 325 330 335 

23376 Pro Glu Arg Asp Gly Leu Phe Cys Met Lys He Phe Gly Pro Thr Lys 

23377 340 345 350 

23378 Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Lys Pro Arg Phe Lys Asp 

23379 355 360 . 365 

23380 He Gly Thr Cys Glu Lys Cys Gly Val Ala He Thr His Ser Lys Val 

23381 370 375 380 

23382 Arg Arg Phe Arg Met Gly His He Glu Leu Ala Thr Pro Val Ala His 

23383 385 390 395 400 

23384 He Trp Tyr Val Asn Ser Leu Pro Ser Arg lie Gly Thr Leu Leu Gly 

23385 405 410 415 

23386 Val Lys Met Lys Asp Leu Glu Arg Val Leu Tyr Tyr Glu Ala Tyr He 

23387 420 425 430 

23388 Val Lys Glu Pro Gly Glu Ala Ala Tyr Asp Asn Glu Gly Thr Lys Leu 

23389 435 440 445 

23390 Val Met Lys Tyr Asp He Leu Asn Glu Glu Gin Tyr Gin Asn He Ser 

23391 450 455 460 

23392 Arg Arg Tyr Glu Asp Arg Gly Phe Val Ala Gin Met Gly Gly Glu Ala 

23393 465 470 475 480 

23394 He Lys Asp Leu Leu Glu Glu He Asp Leu He Thr Leu Leu Gin Ser 

23395 485 490 495 

23396 Leu Lys Glu Glu Val Lys Asp Thr Asn Ser Asp Ala Lys Lys Lys Lvs 

23397 500 505 ' 510 

23398 Leu He Lys Arg Leu Lys Val Val Glu Ser Phe Leu Asn Ser Gly Asn 

23399 515 520 525 

23400 Arg Pro Glu Trp Met Met Leu Thr Val Leu Pro Val Leu Pro Pro Asp 
.23401 530 535 540 

23402 Leu Arg Pro Leu Val Ala Leu Asp Gly Gly Lys Phe Ala Val Ser Asp 

23403 545 550 555 560 

23404 Val Asn Glu Leu Tyr Arg Arg Val He Asn Arg Asn Gin Arg Leu Lys 

23405 565 570 575 

23406 Arg Leu Met Glu Leu Gly Ala Pro Glu He He Val Arg Asn Glu Lys 

23407 580 585 590 

23408 Arg Met Leu Gin Glu Ala Val Asp Val Leu Phe Asp Asn Gly Arg Ser 

23409 595 600 605 

23410 Thr Asn Ala Val Lys Gly Ala Asn Lys Arg Pro Leu Lys Ser Leu Ser 

23411 610 615 620 

23412 Glu He He Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn Leu Leu Glv 

23413 625 630 635 640 

23414 Lys Arg Val Asp Phe Ser Gly Arg Ser Val He Val Val Gly Pro Asn 
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23415 



645 



650 



655 



23416 


Leu 


Lys 


Met 


Asp 


Glu 


Cys 


Gly 


Leu 


Pro 


Lys Asn 


Met 


Ala 


Leu 


Glu 


Leu 


23417 








660 










665 








670 






23418 


Phe 


Lys 


Pro 


His 


Leu 


Leu 


Ser 


Lys 


Leu 


Glu Glu Arg Gly Tyr Ala 


Thr 


23419 






675 










680 








685 








23420 


Thr 


Leu 


Lys 


Gin 


Ala 


Lys 


Arg 


Met 


He 


Glu Gin 


Lys 


Ser 


Asn 


Glu 


Val 


23421 




690 










695 








700 










23422 


Trp 


Glu 


Cys 


Leu 


Gin 


Glu 


He 


Thr Glu Gly Tyr 


Pro 


Val 


Leu 


Leu 


Asn 


23423 


705 










710 








715 










720 


23424 


Arg 


Ala 


Pro 


Thr 


Leu 


His 


Lys 


Gin 


Ser 


He Gin 


Ala 


Phe 


His 


Pro 


Lys 


23425 










725 










730 








735 


23426 


Leu 


He 


Asp 


Gly 


Lys 


Ala 


lie 


Gin 


Leu 


His Pro 


Leu 


Val 


Cys 


Ser 


Ala 


23427 








740 










745 








750 






23428 


Phe 


Asn 


Ala 


Asp 


Phe 


Asp 


Gly Asp Gin Met Ala Val 


His 


Val 


Pro 


Leu 


23429 






755 










760 








765 








23430 


Ser 


Gin 


Glu 


Ala 


He 


Ala 


Glu 


Cys 


Lys 


Val Leu 


Met 


Leu 


Ser 


Ser 


Met 


23431 




770 










775 








780 










23432 


Asn 


He 


Leu 


Leu 


Pro 


Ala 


Ser 


Gly Lys 


Ala Val 


Ala 


He 


Pro 


Ser 


Gin 


23433 


785 










790 








795 










800 


23434 


Asp 


Met 


Val 


Leu 


Gly 


Leu 


Tyr 


Tyr 


Leu 


Ser Leu 


Glu 


Lys 


Ser Gly Val 


23435 










805 










810 








815 




23436 


Lys 


Gly 


Glu 


His 


Lys 


Leu 


Phe 


Ser 


Ser 


Val Asn 


Glu 


He 


He 


Thr 


Ala 


23437 








820 










825 








830 






23438 


He 


Asp 


Thr 


Lys 


Glu 


Leu 


Asp 


He 


His 


Ala Lys 


He 


Arg 


Val 


Leu 


Asp 


23439 






835 










840 








845 






23440 


Gin 


Gly Asn 


He 


He 


Ala 


Thr 


Ser 


Ala 


Gly Arg 


Met 


He 


He 


Lys 


Ser 


23441 




850 










855 








860 








23442 


He 


Leu 


Pro 


Asp 


Phe 


He 


Pro 


Thr 


Asp 


Leu Trp 


Asn 


Arg 


Pro 


Met 


Lys 


23443 


865 










870 








875 










880 


23444 


Lys 


Lys 


Asp 


He 


Gly Val 


Leu 


Val 


Asp 


Tyr Val 


His 


Lys 


Val 


Gly Gly 


23445 










885 










890 








895 




23446 


He Gly 


He 


Thr 


Ala 


Thr 


Phe 


Leu 


Asp 


Asn Leu 


Lys 


Thr 


Leu 


Gly 


Phe 


23447 








900 










905 








910 




23448 


Arg 


Tyr 


Ala 


Thr 


Lys 


Ala 


Gly 


He 


Ser 


He Ser 


Met 


Glu 


Asp 


He 


He 


23449 






915 










920 








925 






23450 


Thr 


Pro 


Lys 


Asp 


Lys 


Gin 


L ys 


Met 


Val 


Glu Lys 


Ala 


Lys 


Val 


Glu 


Val 


23451 




930 










935 








940 








23452 


Lys 


Lys 


He 


Gin 


Gin 


Gin 


Tyr Asp Gin Gly Leu 


Leu 


Thr 


Asp 


Gin 


Glu 


23453 


945 










950 








955 








960 


23454 


Arg 


Tyr 


Asn 


Lys 


He 


He 


Asp 


Thr 


Trp 


Thr Glu 


Val 


Asn 


Asp 


Lys 


Met 


23455 










965 










970 








975 




23456 


Ser 


Lys 


Glu 


Met 


Met 


Thr 


Ala 


He 


Ala 


Gin Asp 


Lys 


Glu 


Gly 


Phe 


Asn 


23457 








980 










985 








990 






23458 


Ser 


He 


Tyr Met 


Met 


Ala 


Asp 


Ser 


Gly Ala Arg 


Gly 


Ser 


Ala 


Ala 


Gin 


23459 






995 








1000 






1005 








23460 


He 


Arg 


Gin 


Leu 


Ser 


Ala 


Met 


Arg 


Gly Leu Met 


Thr 


Lys 


Pro 


Asp 


Gly 


23461 


1010 








1015 






1020 








23462 


Spas. 


He 


He 


Glu 


Thr 


Pro 


He 


He 


Ser 


Asn Phe 


Lys 


Glu 


Gly 


Leu 


Asn 




1030 



1035 



1040 
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234 64 Val Leu Glu Tyr Phe Asn Ser Thr His Gly Ala Arg Lys Gly Leu Ala 

23465 1045 1050 1055 

23466 Asp Thr Ala Leu Lys Th^ Ala Asn Ala Gly Tyr Leu Thr Arg Lys Leu 

23467 1060 1065 1070 

23468 He Asp Val Ser Gin Asn Val Lys Val Val Ser Asp Asp Cys Gly Thr 

23469 1075 1080 1085 

23470 His Glu Gly He Glu He Thr Asp He Ala Val Gly Ser Glu Leu He 

23471 1090 1095 1100 

23472 (pn**Pro Leu Glu Glu Arg He Phe Gly Arg Val Leu Leu Glu Asp Val 
E--> 23473(^10^ 1110 1115 1120 

23474 He Asp Pro He Thr Asn Glu He Leu Leu Tyr Ala Asp Thr Leu He 

23475 1125 1130 1135 

23476 Asp Glu Glu Gly Ala Lys Lys Val Val Glu Ala Gly He Lys Ser He s\ ji 

1140 1145 1150 /J^<&-?0<s 

23478 Thr He Arg Thr Pro Val Thr Cys Lys Ala Pro Lys Gly Val Cys Ala 

23479 1155 1160 1165 

23480 Lys Cys Tyr Gly Leu Asn Leu Gly Glu Gly Lys Met Ser Tyr Pro Gly 

23481 1170 1175 1180 

23482 >^2jT\Ala Val Gly Val Val Ala Ala Gin Ser He Gly Glu Pro Gly Thr 
E--> 23483kOA5^ 1190 1195 1200 

23484 Gin Leu Thr Leu Arg Thr Phe His Val Gly Gly Thr Ala Ser Arg Ser 

23485 1205 1210 1215 

23486 Gin Asp Glu Arg Glu He Val Ala Ser Lys Glu Gly Phe Val Arg Phe 

23487 1220 1225 1230 

23488 Tyr Asn Leu Arg Thr Tyr Thr Asn Lys Glu Gly Lys Asn He He Ala 

23489 1235 1240 1245 

23490 Asn Arg Arg Asn Ala Ser He Leu Val Val Glu Pro Lys He Lys Ala 

23491 1250 1255 1260 

23492 /Pr©) Phe Asp Gly Glu Leu Arg He Glu Thr Val Tyr Glu Glu Val Val 
E--> 23493 Kj&X 1270 1275 1280 

23494 Val Ser Val Lys Asn Gly Asp Gin Glu Ala Lys Phe Val Leu Arg Arg 

23495 1285 1290 1295 

23496 Ser Asp He Val Lys Pro Ser Glu Leu Ala Gly Val Gly Gly Lys He 

23497 1300 1305 1310 

23498 Glu Gly Lys Val Tyr Leu Pro Tyr Ala Ser Gly His Lys Val His Lys 

23499 1315 1320 1325 

23500 Gly Gly Ser He Ala Asp He He Gin Glu Gly Trp Asn Val Pro Asn 

23501 1330 1335 1340 

23502 /-Arg^Cle Pro Tyr Ala Ser Glu Leu Leu Val Lys Asp Asn Asp Pro He 
E--> 23503 1350 1355 1360 

23504 Ala. Gin Asp Val Tyr Ala Lys Glu Lys Gly Val He Lys Tyr Tyr Val 

23505 1365 1370 1375 

23506 Leu Glu Ala Asn His Leu Glu Arg Thr His Gly He Lys Lys Gly Asp 

23507 1380 1385 1390 

23508 Met Val Ser Glu Lys Gly Leu Phe Ala Val He Ala Asp Asp Asn Gly 

23509 1395 1400 1405 

23510 Arg Glu Ala Ala Arg His Tyr He Ala Arg Gly Ser Glu He Leu lie 

23511 1410 1415 1420 

23512 Asp Asp Asn Ser Glu Val Ser Thr Asn Ser Val He Ser Lys Pro Thr 
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E--> 



E--> 23513 

23514 
23515 
23516 
23517 
23518 
23519 
23520 
23521 
23522 
23523 
23524 
23525 
23526 
23527 
23528 
23529 
23530 
23531 
23532 
23533 
23534 
23535 
23536 
23537 
23538 
23539 
23540 
23541 
23542 
23543 
23544 
23545 
23546 
23547 
23548 
23549 
23550 
23551 
23552 
E--> 23553 
23554 
23555 
23556 
23557 
23558 
23559 



E--> 



Input Set : A:\06132.041001 sequence listing.txt 
Output Set: N:\CRF3\07032001\I881752.raw 

i^2p 1430 1435 1440 

Thr Asn Thr Phe Lys Thr He Ala Thr Trp Asp Pro Tyr Asn Thr Pro 

1445 1450 1455 

He He Ala Asp Phe Lys Gly Lys Val Gly Phe Val Asp Val He Ala 

1460 1465 1470 

Gly Val Thr Val Ala Glu Lys Glu Asp Glu Asn Thr Gly He Thr Ser 

1475 1480 1485 

Leu Val Val Asn Asp Tyr He Pro Ser Gly Tyr Lys Pro Ser Leu Phe 

1490 1495 1500 

L#* Glu Gly Ala Asn Gly Glu Glu Met Arg Tyr Phe Leu Glu Pro Lys 

1510 1515 1520 

Thr Ser He Ala He Ser Asp Gly Ser Ser Val Glu Gin Ala Glu Val 

1525 1530 1535 

Leu Ala Lys lie Pro Lys Ala Thr Val Lys Ser Arg Asp He Thr Gly 

1540 1545 1550 

Gly Leu Pro Arg Val Ser Glu Leu Phe Glu Ala Arg Lys Pro Lys Pro 

1555 1560 1565 

Lys Asp Val Ala He Leu Ser Glu Val Asp Gly He Val Ser Phe Gly 

1570 1575 1580 

Jys^pro He Arg Asn Lys Glu His He He Val Thr Ser Lys Asp Gly 
0*2/ 1590 1595 1600 

Arg Ser Met Asp Tyr Phe Val Asp Lys Gly Lys Gin He Leu Val His 

1605 1610 1615 

Ala Asp Glu Phe Val His Ala Gly Glu Ala Met Thr Asp Gly Val He 

1620 1625 1630 

Ser Ser His Asp He Leu Arg He Ser Gly Glu Lys Glu Leu Tyr Lys 

1635 1640 1645 

Tyr He Val Ser Glu Val Gin Gin Val Tyr Arg Arg Gin Gly Val Ser 
1650 1655 1660 

^Te Ala Asp Lys His He Glu He He Val Ser Gin Met Leu Arg Gin 
\$$? 1670 1675 1680 

Val Arg He Leu Asp Ser Gly Asp Ser Lys Phe He Glu Gly Asp Leu 

1685 1690 1695 

Val Ser Lys Lys Leu Phe Lys Glu Glu Asn Ala Arg Val He Ala Leu 

1700 1705 1710 

Lys Gly Glu Pro Ala He Ala Glu Pro Val Leu Leu Gly He Thr Arg 

1715 1720 1725 

Ala Ala He Gly Ser Asp Ser He He Ser Ala Ala Ser Phe Gin Glu 
X^7 30 1735 1740 

^hr/Thr Lys Val Leu Thr Glu Ala Ser lie Ala Met Lys Lys Asp Phe 

1750 1755 1760 

Leu Glu Asp Leu Lys Glu Asn Val Val Leu Gly Arg Met lie Pro Val 

1765 1770 1775 

Gly Thr Gly Met Tyr Lys Asn Lys Lys He Val Leu Arg Ala Leu Glu 

1780 1785 1790 

Asp Asn Ser Lys Phe 
1795 
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L 
L 
L 
L 
L 
L 
L 
L 
L 
M 
L 
L 
L 
L 
L 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
M: 
L: 
M: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
M: 



:3 M:220 C: Keyword misspelled or invalid format, 
:7 M:220 C: Keyword misspelled or invalid format, 
:28 M:220 C: Keyword misspelled or invalid format, 
:29 M:220 C: Keyword misspelled or invalid format, 



[(1) GENERAL INFORMATION 
[(ii) TITLE OF INVENTION 
APPLICATION NUMBER 
FILING DATE 



:1332 
:1358 
:1522 
:1522 
:1522 



M:341 
M:341 
M:254 
M:320 
M:lll 



[(A) 
[(B) 



:254 Repeated in SeqNo=21 



(46) "n" or "Xaa" used, for SEQ ID# : 19 

(46) "n" or "Xaa" used, for SEQ ID#:20 

No. of Bases conflict, Input: 575 Counted: 1815 SEQ: 21 
(1) Wrong Nucleic Acid Designator, 20 

(47) String data converted to upper case, 



] 



: 152.6 
:1530 
:1534 
:1538 
:1542 
:1546 
:1549 
:2276 
:2308 
:5730 
:5780 
:6526 
:6585 
:8330 
:9001 
:9027 
:9078 
9206 
10283 



M:336 
M:336 
M:336 
M: 336 
M:336 
M:336 
M:204 
M: 341 
M:341 
M: 341 
M: 341 
M:341 
M:341 
M: 333 
M:341 
M:341 
M:341 
M:341 
M:332 



W: 
W: 
W: 
W: 
W: 
W: 
E: 
W: 
W: 
W: 
W: 
W: 
W: 
E: 
W: 
W: 
W: 
W: 
E: 



Invalid Amino Acid Number in Coding Region, 

Invalid Amino Acid Number in Coding Region, 

Invalid Amino Acid Number in Coding Region, 

Invalid Amino Acid Number in Coding Region, 

Invalid Amino Acid Number in Coding Region, 

Invalid Amino Acid Number in Coding Region, 



SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



ID:21 
ID:21 
ID:21 
ID:21 
ID:21 
ID: 21 



No. of Bases differ, LENGTH : Input : 2169 Counted: 2217 SEQ: 21 



for SEQ ID#:29 
for SEQ ID#:30 
for SEQ ID#:77 
for SEQ ID#:78 
for SEQ ID#:85 
for SEQ ID#:86 



(46) "n" or "Xaa" used, 
(46) "n" or "Xaa" used, 
(46) "n" or "Xaa" used, 
(46) "n" or "Xaa" used, 
(46) "n" or "Xaa" used, 
(46) "n" or "Xaa" used, 
Wrong sequence grouping, Nucleotides not in groups! 
(46) "n" or "Xaa" used, for SEQ ID#:125 
(46) "n" or "Xaa" used, for SEQ ID#:126 
(46) "n" or "Xaa" used, for SEQ ID#:127 
(46) "n" or "Xaa" used, for SEQ ID#:128 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 138 



332 Repeated in SeqNo-138 
11815 M:332 E: (32) Invalid/Missing Amino Acid Numbering, 
332 Repeated in SeqNo=150 



SEQ ID: 150 



15906 
16012 
19424 
19498 
22418 
22492 
22888 
23344 
23463 



341 
341 
341 
341 



W: 
W: 
W: 
W: 



341 W: 
341 W: 
M:341 W: 
M:341 W: 
M:332 E: 



332 Repeated in SeqNo=308 



(46) "n" or "Xaa" used, for SEQ ID#:203 

(46) "n" or "Xaa" used, for SEQ ID#:204 

(46) "n" or "Xaa" used, for SEQ ID#:255 

(46) "n" or "Xaa" used, for SEQ ID#:256 

(46) "n" or "Xaa" used, for SEQ ID#:299 

(46) "n" or "Xaa" used, for SEQ ID#:300 

(46) "n" or "Xaa" used, for SEQ ID#:307 

(46) "n" or "Xaa" used, for SEQ ID#:308 

(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 308 
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